A REFINEMENT OF CAYLEY'S FORMULA FOR TREES 



IRA M. GESSEL* AND SEUNGHYUN SEO 

Abstract. A proper vertex of a rooted tree with totally ordered vertices is a vertex 
that is less than all its proper descendants. We count several kinds of labeled rooted 
trees and forests by the number of proper vertices. Our results are all expressed in terms 
of the polynomials 



P n (a, b,c) = c (ia + (n — i)b + c), 



which reduce to to (n+ 1)" 1 for a = b = c = 1. 

Our study of proper vertices was motivated by Postnikov's hook length formula 

T v v w 

where the sum is over all unlabeled binary trees T on n vertices, the product is over all 
vertices v of T, and h(v) is the number of descendants of v. Our results give analogues 
of Postnikov's formula for other types of trees, and we also find an interpretation of the 
polynomials P„(a, 6, c) in terms of parking functions. 



1. Introduction 

Cayley [2j showed that there are n n ~ 2 (unrooted) trees on n vertices. Equivalently, 
there are (n + l) n ~ l forests of rooted labeled trees on n vertices. In this paper we study 
the homogeneous polynomials 

n-1 

P n {a,b,c) =c\\{ia+{n-i)b + c). (1.1) 

i=i 

which reduce to (n + l)™ -1 for a = b = c = 1. We refine Cayley' s formula by showing 
that P n (a,b,c) counts rooted forests by the number of trees and the number of proper 
vertices, which are vertices that are less than all of their proper descendants. Moreover, 
other evaluations of P n (a,b,c) have similar interpretations for other types of trees and 
forests: /c-ary trees, forests of ordered trees, and forests of £;-colored ordered trees (which 
reduce to ordered trees for k = 1). We also give an interpretation of P n (a,b,c) in terms 
of parking functions. 

This work developed from a problem posed by Alexander Postnikov at the Richard 
Stanley 60th Birthday Conference. He gave the following identity, which he had derived 
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indirectly [TTJ [T2] and asked for a direct proof: 

(» + r'^EIl(i+^). a*) 

where the sum is over all unlabeled (incomplete) binary trees T on n vertices, the product 
is over all vertices v of T, and /i(t> ) is the hook length of v, which is the number of 
descendants of v, including v. 

Direct combinatorial proofs of Postnikov's identity were given by Seo |14j . Du and Liu 
[5], and Chen and Yang [3]. Seo's approach involved counting binary trees by proper 
vertices in the following way: We rewrite (|1.2j) as 

2 >+ir 1 = -!En( 1 + /^) ) (i-3) 

As we will see in Section |21 (see also ^1]), a straightforward argument shows that the 
right side of (jl.3j) is equal to YIb 2 propB where the sum is over all labeled binary trees B 
on [n] = {1,2, ... ,n} and prop B is the number of proper vertices of B. This observation 
led us to consider the polynomial ^ B t propB , which was shown in to be equal to 
P n {2, t, t). Extending this result to other kinds of forests and trees by proper vertices, and 
we found, surprisingly, that all of the results can be expressed in terms of the polynomials 
P n (a, b, c). Moreover, each type of forest or tree has a hook length formula analogous to 
Postnikov's. (Some of these hook length formulas have been found earlier by Du and Liu 

00 

A closely related statistic to the number of proper vertices is the number of proper edges, 
studied by Shor [T3], Zeng pOj, and Chen and Guo jSj, where an edge {i, j} is proper if 
i is the parent of j and i is less than all the descendants of j. Despite the similarity in 
definitions, the distribution of proper edges in trees is very different from that of proper 
vertices. 

In Section 2, we prove some lemmas relating proper vertices of forests to hook length 
formulas. Sections 3 and 4 contain background information on exponential generating 
functions and the enumeration of trees and forests. Some properties of the polynomials 
P n (a, h, c), including a differential equation satisfied by its generating function, are proved 
in Section 5. The next three sections contain our main results on the enumeration by 
proper vertices of forests of rooted trees, /c-ary trees, and forests of ordered trees, with 
corresponding hook length formulas. In Section 9 we discuss another interpretation of 
P n (a,b,c), due to Egecioglu and Remmel in terms of ascents and descents in forests, 
and in Section 10 we give a parking function interpretation to a specialization of P n (a, b, c). 

2. Hook length formulas 

Throughout this paper, all trees are rooted, and all forests are forests of rooted trees. 
An ordered tree is a tree in which the children of each vertex are linearly ordered. An 
ordered forest is a forest of ordered trees in which the trees are also linearly ordered. We 
write V(F) for the set of vertices of the forest F. 
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A vertex v in a tree is a descendant of a vertex u if u lies on the unique path from the 
root to v. If v is a descendant of u, it is a proper descendant if it is not equal to u. 

Let F be a forest with n vertices. Then a labeling of F is a bijection from V(F) to [n]. 
A vertex of F is proper with respect to a labeling of F if its label is less than the labels 
of all its proper descendants. The hook length h(v) of a vertex v in a forest is the number 
of descendants of v, including v. 

We will consider various kinds of trees and forests in this paper, and for each of them 
we will have a "hook length formula" something like (jl.Hj) . These hook length formulas 
follow from our results on counting forests by proper vertices and the following lemmas, 
which relates hook lengths to proper vertices. 

Lemma 2.1. Let F be a forest with n vertices, and let S be a subset of V(F). The the 
number of labelings of F in which every vertex in S is proper is 



Proof. If all labelings of F are equally likely, then the probability that a vertex v is proper 
is l/h(v). It can be shown that these probabilities are independent, and this implies the 
lemma. □ 



Lemma 2.2. Let F be a forest with n vertices. Then 



L veV(F) v v ;/ 

where L runs over all labelings of F. 

Proof. The left side of (j2.1j) is equal to s a' 5 ' where L runs over all labelings of F and 
S runs over all subsets of the set of vertices of F that are proper with respect to L. The 
right side of ()2.H) is equal to 

n\ 



SQV{F) IUs Hv, 

The result then follows from Lemma f2. II □ 

Let us say that two labelings of a forest F are isomorphic if there is an automorphism 
of F that takes one labeling to the other. (An automorphism of an ordered forest must 
preserve the order structure, so ordered forests have no nontrivial automorphisms.) Let F 
be a forest on n vertices with automorphism group G. Then the n\ labelings of F fall into 
n!/|G| isomorphism classes. It is clear that isomorphic labelings have the same number 
of proper vertices, so we obtain the following lemma by dividing Lemma [2.21 bv 

Lemma 2.3. Let F be a forest with n vertices and let £ be a set of representatives of 
isomorphism classes of labelings of F . Then 

Lec veV(F) v y ,J 
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Let us now define a labeled forest to be a forest whose vertex set is [n] for some n. Thus 
replacing each vertex of a labeling of an arbitrary forest gives a labeled forest, and two 
labelings give the same labeled forest if and only if the labelings are isomorphic. 

Then (J2.2J) . together with linearity, gives the following result. 

Lemma 2.4. Let T be a class of labeled forests with vertex set [n] in which membership 
depends only on isomorphism class. Then 



For ordered forests, which have no nontrivial automorphisms, we can simplify this 
result, since each isomorphism class of ordered forests on n vertices corresponds to n\ 
labeled ordered forests. An unlabeled forest may be formally defined as an isomorphism 
class of forests, though we think of an unlabeled forest as a forest in which the identity 
of the vertices is irrelevant. Then the following lemma follows from either Lemma f2. 31 or 
Lemma 12.41 

Lemma 2.5. Let A be a set of unlabeled ordered forests with n vertices, and let T be the 
corresponding set of labeled ordered forests. Then 



where the product is over all vertices v of F. 

In later sections, it will be convenient to use the term "labeled forest" to refer more 
generally to any forest whose vertices are integers. 



We assume that the reader is familiar with the combinatorial interpretation of basic 
operations on exponential generating functions: addition, multiplication, composition, 
and differentiation, as described, for example in [T71 Chapter 5] or PQ. Rather than 
adapting the formal language of P, we take a more informal approach to the connec- 
tion between exponential generating functions and the objects that they count. Thus in 
deriving functional or differential equations for generating functions we will describe the 
decompositions that lead to these equations, but omit the straightforward details of how 
the equations follow from the decompositions. All of our exponential generating functions 
will be in the variable x, so we will usually omit it, and write, for example A instead of 
A(x). Since nearly all our generating functions are exponential, we will usually call them 
simply "generating functions"; when we use ordinary generating functions we will note 
this explicitly. 

There are two kinds of decompositions of labeled objects that are especially important: 
unordered and ordered decompositions. An unordered decomposition is a decomposition 
of a labeled object into an unordered collection of objects. An important example is the 
decomposition of a forest into its component trees. When there is an unordered decom- 
position of objects counted by a generating function / into objects counted by g, the 





(2.3) 



3. Exponential generating functions 
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generating functions are related by the "exponential formula" f = e 9 . Decompositions 
may depend on a total ordering of the labels. For example, we can decompose a permu- 
tation (considered as a linear arrangement) by cutting it before each left-right minimum, 
yielding a set of permutations each starting with its smallest element. (Thus the permu- 
tation 4753126 is decomposed into {475, 3, 126}.) The permutation can be reconstructed 
from its constituents by arranging them in decreasing order of their first elements. The 
generating function identity in this example is 

J>!- = exp(]T(n -1)!- 

n=0 ' V n=l 

An ordered decomposition is a decomposition into a sequence of objects. If objects 
counted by / have an ordered decomposition into objects counted by h then / and h are 
related by / = 1/(1 — h). The simplest example is the decomposition of a permutation, 
considered as a linear arrangement, into the sequence of its entries; here g = x and 
/ = 1/(1 — x). Another important example is the decomposition of an ordered forest into 
its constituent trees. 

If an ordered decomposition for a class of labeled objects, then an unordered decompo- 
sition must also exist: if / = 1/(1 — h) then / = exp(log(l — h)^ 1 ) and we can interpret 
this by replacing x with h in the combinatorial interpretation given above for (|3.1|) . We 
will see an example of this kind of decomposition in Section |SJ 

4. Trees and forests 

We introduce here the types of trees and forests that we will count by various parameters 
in later sections. Most of this material is well known, but it is convenient to clarify the 
definitions and state the known formulas that we will generalize later. 

Let A be the generating function for labeled forests. Then since a tree consists of a 
root together with the forest of proper descendants of the root, the generating function 
for trees is xA, and since a forest is a set of trees, the generating function for forests is 
e xA . Thus A satisfies the functional equation 

A = e xA . (4.1) 

It is well known that the solution of (j4.1|) is 

oo n 
n=0 

and that more generally, 

oo n 

A c = J2c(n + c) n - 1 ^ J . (4.2) 

n=0 U ' 

Equivalent formulas can be found in Sections 1.2.5 and 1.2.7], jTTl Proposition 5.4.4] 
and [13, p. 97]. Since A c = e cXogA = e cxA , the coefficient of c l in A c is the generating 
function for forests of i trees. 

We say that a vertex of a tree has degree d if it has d children. A fc-ary tree is an 
ordered tree in which every vertex has degree k or 0. The ordinary generating function D 




A REFINEMENT OF CAYLEY'S FORMULA 



6 



for unlabeled k-ary trees in which the coefficient of x n is the number of fc-ary trees with 
n vertices of degree k (and thus 1 + (k — l)n vertices of degree 0) satisfies 

D = l + xD k , (4.3) 

and it is well known that 

D* = f:- r ?-( kn + C )ar. (4.4) 
^ kn + c V n J 

n=0 v 7 

See, for example, [HI Exercise 2.7.1], [17, Proposition 6.2.2], and P pp. 200-201]. In 
computing hook lengths of fc-ary trees, we will ignore vertices of degree 0. Thus the hook 
length of the root of the ternary tree in Figure Q is 6. 

For our purposes a labeled fc-ary tree is a fc-ary tree in which only the vertices of degree 
k are labeled. Figure Q shows a labeled ternary tree. The exponential generating function 




Figure 1. A labeled ternary tree 

for labeled fc-ary trees is the same, as a power series, as the ordinary generating function 
for unlabeled fc-ary trees, but as an exponential generating function we would rewrite 
(jOjl as 

D c = ^c{kn + c- l)(Jfen + c- 2) •••((*- l)n + c+ 1)^ 

n=0 n ' 

The generating function E for ordered trees (by the total number of vertices) satisfies 
E = x/(l — E). As in the case of fc-ary trees this formula holds for both the ordinary 
generating function, which counts unlabeled ordered trees, and for the exponential gener- 
ating function, which counts labeled ordered trees. The generating function F for ordered 
forests (in either the unlabeled or labeled version) satisfies F = 1/(1 — E) and E = xF, 
and thus F = 1/(1 - xF). 

The formula F = 1/(1 — xF) is easily transformed into F = 1 + xF 2 , so, as is well 
known, the generating function for ordered forests is the same as that for binary trees. 
In order to generalize this to /c-ary trees, we introduce k- colored ordered forests with 
generating function F satisfying 

1 , , 
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A ^-colored ordered forest is a forest of ordered trees in which the edges are colored in 
colors 1, 2, . . . , k with the property that among the children of any vertex, those of color 
1 come first, then those of color 2, and so on. (But not every color must appear.) For 




Figure 2. A 2-colored forest 

example, Figure El shows a 2-colored ordered forest in which color 1 is represented by solid 
lines and color 2 by dotted lines. Since ()4.5|) is equivalent to F — 1 + xF k+1 , fc-colored 
forests are equinumerous with (k + l)-ary trees, and the well-known bijection from forests 
of ordered trees to binary trees (see, e.g., Stanton and White [T8J p. 61]) generalizes easily 
to this situation. 



5. The generating function for P n (a,b,c) 
Our main objects of study are the polynomials P n (a, b, c) defined by 

n-1 

P n (a, b, c) = c J^(«a + (n — i)b + c). (5.1) 
i=i 

Note that P n (a, b, c) is symmetric in a and b. It is also homogeneous of degree of n in a, 
b, and c, so with no loss of information we could set any of the variables to one (or to any 
other nonzero value). 

We denote by Q n (a, b) the coefficient of c in P n (a, b, c), so 

n-1 

QJa, b) = c _1 P n (a, b, c) = TT(za + (n - i)b). 

c =o ±J - 

i=l 

The polynomials P n (a,b,c) generalize the counting formulas of the previous section, 
since 

cin + c^-^Pnihhc) 

and 

c fkn + 
kn + c\ n 

Since the polynomial ()5.2)1 has two free parameters, by rescaling we can obtain P n (a, b, c) 
and we find that for a ^ b, we have 

P n (a, b, c)/n\ = {b- aYj^— f 5 " + C ) , (5.3) 

bn + c V n J 



n\=P n (k-l,k,c). (5.2) 
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where b = b/(b — a) and c = cj ib — a). 
It follows from Q, (JOJ) and flOJ) that 



n=0 

where C7 satisfies 

G = l + {b-a)xG b/{b - a \ (5.4) 

It does not seem to be easy to obtain a combinatorial interpretation for P n (a, b, c) directly 
from (|5.4jl . Instead, we derive a differential equation whose combinatorial meaning is 
clearer. Let H = G 1/{b ~ a \ so that 



Y j P n (a 1 b 1 c)^- = H c . (5.5) 



n=0 



We claim that i7 satisfies the differential equation 

#' = F a+1 + bxH a H'. (5.6) 

To see this, (15. 4 J) gives 

= 1 + (6 - a)a;F 6 . (5.7) 

Differentiating with respect to x and dividing both sides by (b — a)H b ~ a ~ 1 gives (|5.6|) . 
(Although this derivation is not valid for a = b, a limit argument shows that (|5.6J) does 
hold in this case.) Conversely, (|5.6|) . together with the initial value H(0) = 1, implies 
(f5~7j) . and thus (l53|) . 

We note that (|5.7|) can be written in the more symmetric and combinatorially suggestive 
form 

H b (l + axH a ) = H a (l + bxH b ); (5.8) 

however, we have not found a combinatorial interpretation to ()5.8|) . 

We now give a completely self-contained proof that ()5.6|) implies (|5.5|) . For our appli- 
cations, it is convenient to add another redundant variable to the differential equation. 

Theorem 5.1. Let g = g(x) be the solution of the differential equation 

g> = U g b+1 + axg b g' (5.9) 

with g(0) = 1, where the prime denotes the derivative with respect to x. Then 



g c = Y j P n (aM,cu)^. (5.10) 

^— < n.< 

and 

X n 

logg =}uQ n (a,bu) — . (5.11) 



71=0 



n=l 
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Proof. Let P n (c) be the coefficient of x n /n\ in g c , where the dependence of P n (c) on u, a, 
and b is implicit. From (|5.9|) we obtain 



(g c )' = cug b+c + acV +c ~y = <V +C + ^(<7 &+c )' (5-12) 



Equating coefficients of x n /n\ in (|5.12jl gives 

— — TIQC — 

Pn+l(c) = CuP n (b + c) + T—Pn(p + c) 

+ c 

c(na + bu + cm) - . 

= — P n (b + c . 

+ c 

Since -Po( c ) = 1, we get 

P n (c) = cu(a + (n — l)bu + cu)(2a + (n — 2)5w + cm) • • • ((n — l)a + few + cm). 
The formula for log g follows by taking the coefficient of c in g c . □ 

6. Counting forests by proper vertices 

As noted earlier, P n (l, 1, 1) = (n+ l) n_1 is the number of forests on [n]. Thus we might 
hope that P n (a,b,c) counts these forests according to some parameters. Theorem 16.11 
shows that this is the case. 

For a forest F, let prop F be the number of proper vertices and let tree F be the number 
of trees in F. We give two proofs of the following result, one using the differential equation 
for the generating function for P n (a,b,c) and one which is more combinatorial, though 
not completely bijective. 

Theorem 6.1. We have 

a n - piopF b piopF ~ tieeF c tmeF = P n {a, b, c), (6.1) 

F 

where the sum is over all forests F on [n] . 

First proof. Let / be the generating function for forests by the number of proper vertices; 
i.e., 



n=0 F n ' 

where F runs over all forests on [n\. 

Let J be the generating function for trees weighted by proper vertices. Then I = e J . 
Next we will show that J' — ul + xV . Let T be a tree on {0, 1, . . . , n}, interpreted as 
a labeled object with label set [n]. Then T consists of a root together with the forest F 
obtained by deleting the root. If the root of T is then T has one more proper vertex 
than F . Thus the contribution to J' from trees in which the root is is ul. If the root 
is not 0, then T has an improper root with a label in [n], and appears in F. Thus the 
contribution to J' from trees in which the root is not is xV . 

Then 

l> = ( e J )' = e J f = I(ul + xF) = ul 2 + xll', 



A REFINEMENT OF CAYLEY'S FORMULA 



10 



so by Theorem 15.11 



oo 



,71 



I = J]P ft (l,u,u) 



x 



n! 



n=0 



as shown by Seo Corollary 8]. 

The number of proper vertices of a forest is the same as the number of proper vertices 
in all its constituent trees, so I c = exp(cJ) counts forests by proper vertices in which 
every tree is weighted by c, which gives 



Using the fact that P n (a, b, c) is a homogeneous polynomial of degree n in a, b, and c, we 



We can interpret (jfj.ljl as counting forests on [n] in which each vertex is assigned a 
weight. We will call a vertex weakly proper if it is proper, but not the smallest vertex in 
its tree and minimal if it is the smallest vertex in its tree. Then (jS3| counts forests in 
which every improper vertex is weighted a, every weakly proper vertex is weighted b, and 
every minimal vertex is weighted c. 

Second proof of Theorem \6.1\ Let p n (a,b,c) count forests F on [n] in which every im- 
proper vertex is weighted a, every weakly proper vertex is weighted b, and every minimal 
vertex in a component of F is weighted c. Then c~ 1 p n (a, b, c) counts forests with the same 
weight on each vertex except that the smallest vertex of F is weighted by 1. Suppose that 
a forest F on [n] is weighted in this way. If 1 is a root of F, delete the vertex 1 of F. Then 
we obtain a forest on {2, 3, . . . , n} in which the minimal vertices are weighted either b or c. 
(See (a) of FigureEl) Thus these forests are counted by p n _i(a, b, b+c). If 1 is not a root of 




(6.2) 



F 



can rescale (|6.2|) to get (jbMj) . 



□ 



(a) 




(b) 




Figure 3. A weight preserving decomposition of forests 



F, interchange the label 1 and its parent i and delete the vertex i. Then we have a forest 
on [n] \ {%} such that the smallest vertex 1 is weighted 1 and the other minimal vertices 
are weighted either b or c. Moreover, there are n — 1 possibilities for the parent % of 1 and 
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it is weighted a. Thus these forests are counted by (n — 1) a • (b + c) 1 p n _i(a, b, b + c). 
(See (b) of Figure El) Combining two cases yields 

c -1 pn(a, b, c) = p n -i{a, b,b + c) + (n - 1) a (b + c) _ Vn-i(a, b, b + c), 

with the initial condition c~ l pi(a,b,c) = 1. It follows easily that 

p n (a,b,c) = P n (a,b,c). 

□ 

The symmetry P n (a, b, c) = P n (b, a, c) implies that among forests of j trees, the number 
with i improper vertices is equal to the number with i weakly proper vertices. (It is clear 
that the case j = 1 of this symmetry implies the general case.) We do not have a bijective 
proof of this symmetry. 

It is instructive to look at the case i = of this symmetry, which can be explained 
by known bijections. By the homogeneity of the polynomials P n (a, b, c), the symmetry is 
equivalent to the identity P n (0, 1, c) = P n (l, 0, c) = c(c + 1) • • • (c + n — 1), in which the 
coefficients are unsigned Stirling numbers of the first kind. First, P n (0, 1, c) counts forests 
on [n] with n proper vertices. These are increasing forests, and there is a well known 
bijection p. 25] that takes an increasing forest of j trees to a permutation with j 
left-right minima. Next, P n (l, 0, c) counts forests in which the number of proper vertices 
equals the number of trees. Since every tree has at least one leaf, and every leaf is proper, 
in these forests every tree has a single leaf, which is the least vertex of the tree. If we 
read the trees from leaf to root, we get a set of j permutations, each of which starts with 
its least element, and as described in Section 01 this set of permutations can be arranged 
to form a permutation with j left-right minima. 

A different symmetry of forests that has a very similar special case is described in 
(See also [TO].) 

Since the proper vertices of a forest are the same as the proper vertices of its component 
trees, (|5.11|) gives a corresponding result for trees; 

Corollary 6.2. We have 

J- a n-propT 6 propT-l = 
T 

where T runs over all trees on [n] . □ 

Applying Lemma 12.41 gives a hook-length formula for forests: 
Corollary 6.3. 




= P n {l,l + a,c{l + a)), 



where F runs over all forests on [n] . □ 
In particular, taking a = 1 gives 

i_ E c «- n (, + 1 ) = l p. (1 , 2 , 2c) = _£_ + ^ , 

n! \ n(v) J n! n + c\ n I 

FeF ve[n] v w/ v 7 
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which for c = 1 is the Catalan number C n+ \. 

7. k-ARY TREES 

In this section we give an analogue of Theorem 16. II for k-ary trees. We first count k-ary 
trees by proper vertices, obtaining a result of Seo jTU Corollary 7] . 

Theorem 7.1. We have 

^ u pio » T = P n (k,(k-l)u,u), (7.1) 

T 

where the sum is over all k-ary trees T on [n]. 

Proof. Let J be the generating function for ^ T u propT as in the statement of the theo- 
rem. By reasoning similar to that used in the proof of Theorem 16.11 we will obtain the 
differential equation 

J' = uJ k + jfcxJ* -1 J'. (7.2) 
We consider k-ary trees on the vertex set {0, 1, . . . , n} and look at the location of vertex 0. 
The term uJ k in (|7.2|) corresponds to the case in which is the root of the tree. If is not 
the root, the tree can be decomposed into a root R (the factor x), the subtree containing 
rooted at a child of R (the factor J') and a (k — l)-tuple of the other subtrees rooted at 
the children of R (the factor J fc_1 ). The factor k is the number of positions in which the 
subtree containing can lie in relation to the other subtrees. 
So by Theorem 15.11 

oo n 

J = J2P n {k,{k-l)u,u)^j. (7.3) 

n=0 

□ 

Applying Lemma 12.21 gives the following hook- length formula for fc-ary trees, which is 
equivalent to Corollary 3.11 of Du and Liu 0. The case k — 2, a — 1 is equivalent to 
Postnikov's formula (jl.3|) . 

Corollary 7.2. We have 




= P n (k, (k - 1)(1 + a), 1 + a) 



where T runs over all unlabeled k-ary trees on n vertices. 

We can generalize Theorem 17. II to include a parameter that corresponds to the number 
of trees in a forest, and thereby find a result analogous to Theorem 16.11 In order to do 
this, we must find an unordered decomposition of k-ary trees that is compatible with 
proper vertices; i.e., no improper vertex of a k-ary tree becomes proper when restricted 
to its component. To define this decomposition we first specify a particular path in a 
fc-ary tree T from the root to a leaf. The rule is that we always go from a labeled vertex 
v to the first child of v unless v is improper and the smallest descendant of v is also a 
descendant of the first child of v. In this case we go to the second child of v. When 
we reach a leaf, we stop. We call this path the decomposing path of T. For example, 
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Figure |U shows a binary tree with its decomposing path. The unordered decomposition 




Figure 4. A binary tree and its decomposing path 

of the fc-ary T is obtained by removing each edge on its decomposing path (except the 
last edge ending at a leaf) and replacing each lost child with a leaf. We denote by compT 
the number components in the decomposition of T, which is the same as the number of 
labeled vertices in the decomposing path. The unordered decomposition of the tree of 
Figure |U is shown in Figure El 




Figure 5. Decomposition of the tree of Figure E] 
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Theorem 7.3. For k > 1 we have 

^^n-propr^ropT-compT^compT = p^^ ^ _ ^ w ) (7.4) 
T 

where the sum is over all k-ary trees T on [n]. 
Proof. We first prove 

M PropT c compT = p^ ^ _ ^ ^ (75) 

T 

By ()5.10|) and Theorem 17.11 to prove ()7.5|) it is sufficient to show that the generating 
function for the left side is J c , where J is as in the proof of Theorem l7.ll This follows from 
a verification that the unordered decomposition for a fc-ary tree has several properties. 
Let the decomposition of a A;-ary tree T be {Ti, . . . , T m }. We must show that comp Tj = 1 
for each i, that T can be recovered from {Ti, . . . ,T m }, and that propT = ^propTj. 
Moreover we must also show that any such set of trees {Ti, . . . , T m } with disjoint labels 
comes from the decomposition of some A;-ary tree. 

The components Tj are characterized by the property that either the first child of Tj 
is a leaf, or the root of Tj is improper, its smallest descendant is a descendant of its first 
child, and its second child is a leaf. This implies that compTj = 1. We can reconstruct 
the order of the components since the first component Ti will contain the smallest vertex 
of T, the next component T2 will contain the smallest vertex of T not in Ti, and so on. To 
reconstruct T, let rj denote the root of tree T,. Then in T, r i+1 must have been either the 
first or second child of rf. r i+1 was the first child of rj in T if and only if the first child of 
Tj is a leaf. The decomposition preserves proper and improper vertices since the smallest 
descendant of any improper vertex v on the decomposing path remains a descendant of 
v in its component (and descendants of vertices not on the decomposing path are not 
changed). The last assertion is clear. 

The homogeneity of P n (a,b,c), together with ()7.5|) . implies that 

^^-propT^propT^ompT = ^ _ ^ ^ 

T 

Replacing c with w/u gives (|7.4j) . □ 

In should be pointed out that Theorem 17 .31 is not valid for unary trees. However, there is 
a different (and simpler) decomposition for unary trees that gives an analogue of Theorem 
17.31 in this case. 

In contrast to Corollary 16.31 we cannot use Theorem 17.31 to add a parameter for com- 
ponents to Corollary 17.21 since the number of components of a £;-ary tree is not an iso- 
morphism invariant. 

8. Ordered forests 

Next we count fc-colored ordered forests by proper vertices. The case k = 1 of the next 
theorem was given by Seo [HI equation (14)]. 
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Theorem 8.1. We have 

J2u pIopF = P n {k,{k + l)u,u), (8.1) 

F 

where the sum is over all k-colored ordered forests F on [n], and for n > 1, 

J2u piopT = -P n {k,{k + l)u,-u), (8.2) 

T 

where the sum is over all k-colored ordered trees T on [n]. 

Proof. Let K be the generating function for fc-colored ordered forests, where a forest F is 
weighted u propF . Let L be the analogous generating function for &;-colored ordered trees, 
so K = 1/(1 - L). We claim that 

U = u K k + kxK k ~ l K'. (8.3) 

By similar reasoning to that used before, we see that in ()8.3|) the term uK counts k- 
colored ordered trees on {0, 1, . . . , n} in which is the root, and for each i from 1 to k, 
xK k ~ 1 K' is the generating function for forests in which is in a forest joined to the root 
by an edge of color i. It follows from ()8.3)1 and K — 1/(1 — L) that 

K' = uK k+2 + kxK k+1 K'. 

So by Theorem 15.11 we have 

oo n 

K = J2P n (k,(k + l)u,u)^j, (8.4) 

n=0 

proving the first formula. 

For the second formula, we note that 

L = l-K' 1 , 

and a formula for the coefficients of K~ x follows from ()8.4j) and (|5.1()j) . 

□ 

Applying Lemma [2.51 to Theorem 18.31 gives hook length formulas for fc-colored ordered 
forests and trees. A result equivalent to the case k — 1 of ()8.5j) was given by Du and Liu 
Theorem 4.14]. 

Corollary 8.2. We have 

nl Yl II ( 1 + rn ) = p ^ ( k + + «). 1 + «)> ( 8 - 5 ) 

where the sum is over all unlabeled k-colored ordered forests F on n vertices, and 
n\J2 II (l + j^)=-Pn(k,(k + l)(l + a),-(l + a)), 

T veV(T) ^ ^ '' 

where the sum is over all unlabeled k-colored ordered trees F on n vertices. 
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From the ordered decomposition of /c-colored ordered forests into fc-colored ordered 
trees, we get an unordered decomposition as described in Section El One way to do this is 
to look at the sequence of roots, and cut the forest after each left-right minimum in this 
sequence. (Instead of using the root of each tree for comparison, we could use the smallest 
or largest element of each tree.) Thus we decompose each fc-colored ordered forest into 
components which are /c-colored ordered forests in which the first root is the smallest. For 
a fc-colored ordered forest F, let comp F be the number of components in this sense; that 
is, the number of left-right minima in the sequence of roots. Then since 



ii 



U,CU)—r, 



X 



oo 

K c = Y, P n(k, (k+l] 

71=0 

we have 

P n (k, (k + l)u,cu) = J2u propF c compF . 

F 

By the same reasoning as in Theorem I7.3| this leads to the analogous formula for 
/c-colored ordered trees: 

Theorem 8.3. We have 

^^-propF^propF-compF^compF = ^ + 

F 

where the sum is over all k-colored ordered forests on [n]. 

As noted in Section there is a simple bijection from ^-colored ordered forests to 
(k + l)-ary trees, but this bijection does not preserve proper vertices. However, we do 
have a relation between proper vertices of these forests and trees: 

Corollary 8.4. We have 

^"-prop T u P r °P r-comp T^comp T _ ^n-prop -F^prop F-comp F^comp F 

T F 

where T runs over (k + l)-ary trees on [n] and F runs over k-colored ordered forests on 
[n\. 

Proof By Theorem 18.31 the sum on F is equal to P n (ku, (k+ l)v,w). In (|7.4|) we replace 
k by k + 1, switch u and v, and use P n (a, b, c) = P n (b, a, c) to show that the sum on T is 
also equal to P n (ku, (k + l)v, w). □ 

9. Descents in trees 

In this section, we describe another forest interpretation, due to Egecioglu and Remmel 
[0], for P n (a, b, c). If vertex i is the parent of vertex j in a tree, we call the edge {i, j} an 
ascent if i < j and a descent if i > j, and for a forest F we write asc F for the number of 
ascents of F and des F for the number of descents of F. (A different notion of descents 
in trees was studied in [7j.) We use the differential equation approach to count forests by 
ascents and descents; Egecioglu and Remmel [H] gave a bijective proof of a more general 
result. 
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Theorem 9.1. We have 

F 

where the sum is over all forests F on [n] . 

Proof. Since the ascents and descents of a forest are the same as those of its constituent 
trees, it is sufficient to prove the case c = 1 of (J9.1|) . Let M be the generating function for 
the polynomials a desF b ascF . We shall show that M satisfies the differential equation 
We shall show that M satisfies the differential equation 

M' = M b+1 + axM b M' ) 

which by Theorem 15. II implies the result. 

Let N = logM, so that M counts trees by descents and ascents. Now let F be a forest 
on the vertex set {0, 1, . . . ,n}. We decompose F into forests F x and F 2 that together 
contain all the vertices of F other than 0. We let Fi be the forest of proper descendants 
of 0, and we let F 2 be the forest of all vertices that are not descendants of 0. In order to 
reconstruct F from F\ and F 2 we consider two cases. First, suppose that is a root of 
F. Then F is determined by F\ and F 2 and the ascents and descents of F are the same 
as those of Fi and F 2 , except that F has one additional ascent for each edge joining 
to one of its children. The contribution to M' from forests in which is a root is thus 
e bN M = M b M = \I h: . 

If is not a root of F, then to reconstruct F from F\ and F 2 it is sufficient to know 
the parent of in F; thus F is determined by Fx, F 2 , and the choice of a vertex in F 2 . 
All of the ascents and descents of Fi and F 2 will be ascents and descents of F, and as in 
the first case, F will have an extra ascent for every child of 0. In addition, F will have 
an extra descent for the edge joining to its parent. The generating function for forests 
with a marked vertex is xM', so the contribution to M' from forests in which is not a 
root is aM b ■ xM'. □ 

10. Parking functions 

A parking function on an n-set K is a function / : K — > [n + 1] satisfying a certain 
property, described below. We think of the elements of K as cars and the elements of 
[n + 1] as parking spaces and we think of / as a function that assigns to each car its 
favorite parking space. The cars arrive in some order at their favorite parking spaces. If 
space f(z) is not already occupied when car z arrives there, then car z parks in space 
f(z). More generally, if there is some available space s, with f(z) < s < n + 1, then car z 
parks in the first (i.e., smallest) such space. Otherwise, car z does not park at all. If all 
cars are able to park, then we call / a parking function. Although cars are not allowed to 
park in space n + 1 , we will see that the existence of this space is useful. It is well known 
(and easy to see) that whether all cars are able to park or not does not depend on their 
arrival order. However, we will assume that the cars are totally ordered and that they 
arrive in increasing order. 

It is useful to represent the parking function / : K — > [n + 1] by the ordered partition 
(Bi, . . . , B n+ i) of K in which Bi = f~ l (i)\ i.e., Bi is the set of cars that prefer to park 
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in space i. A necessary and sufficient condition for / to be a parking function is that for 
each i, \B±\ + • • • + \Bi\ > i. 

It is well known that the number of parking functions on [n] is (n + l) n ~ l [17, p. 140]. 
So the exponential generating function for parking functions is 

A = Y(n + l) n - 1 ^-. 
^ n! 

n=0 

We consider a slight generalization of parking functions. Instead of taking the set of 
parking spaces for n cars to be [n + 1] , we take it to be [n + c] , for some positive integer c 
(but we keep the number of cars at n). Then we call / a c-parking function if all cars are 
able to park using the same parking procedure as before, except that cars may now park 
in spaces 1 to n + c — 1. An alternative characterization of c-parking functions is that 
with Bi = / _1 (i), we have \Bi\ + ■ • • + \Bi\ > i — c+ 1. It is clear that if all cars park, 
they will leave exactly c empty spaces, one of which will be the last space, n + c. Thus 
if we represent a c-parking function by an ordered partition, then cutting it after every 
empty parking space decomposes it into a concatenation of c ordinary parking functions. 
Therefore the generating function for c-parking functions is 

A c = Y ' c(n + c) n ~ 1 ^-. 
/ — ' n\ 

n=0 

Further properties of c-parking functions can be found in Yan [TU] . 

If / is a parking function K — > [n + 1] then we call a car z G K lucky if it succeeds 
in parking in its preferred space. Let lucky / denote the number of lucky cars associated 
to/. 

Theorem 10.1. We have 

^n luck ^ = P n (l,n, M ), 
/ 

where the sum is over all parking functions f : [n] — > [n + 1] . 

Proof. Let q n {u) = ^ M lucky ^, where the sum is over all parking functions / on [n] and 

let Q = En 

D = oQ.n{u)x n /n\. We will show that Q satisfies the differential equation 

Q< = U Q 2 + xQ'Q, (10.1) 

which by Theorem 15 . II implies the result. To do this we evaluate Q' by counting parking 
functions / : [n + 1] — > [n + 2] . The restriction /~ of such a parking function to [n] is 
a 2-parking function, which leaves two empty spaces, e + 1 and n + 2, for some e with 
< e < n. If n + 1 is lucky in /, then f(n + 1) must be e + 1, and the contribution to 
Q' from this case is uQ 2 . If n + 1 is not lucky in / then f(n + 1) is 1, 2, . . . , or e. In 
this case /~ decomposes into a parking function on e cars, with one of the first e spaces 
"marked," followed by a parking function onn-e cars. The generating function for 
parking functions with a marked space is xQ' so the contribution to Q' from this case is 
xQ'Q. Adding together these two contributions gives (jlO.lj) . □ 
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Since the number of lucky cars in a c-parking function is the total number of lucky cars 
among its constituents, the generating function for c-parking functions by lucky cars is 
Q c , and we have the following result: 

Corollary 10.2. For any positive integer c, we have 

5> 1UCky/ = Pn(l,«,C M ), 
/ 

where the sum is over all c-parking functions f : [n] — > [n + c] . 

There is an ordered decomposition for parking functions that gives us a result analogous 
to equation ()8.2|) . We modify the ordered partition representation of a parking function 
/ : K — > [n + 1], where \K\ = n, by removing the empty block B n+ \. Let us call the result 
the reduced ordered partition corresponding to /. Then / is called a prime parking function 
if its reduced ordered partition cannot be decomposed as a nontrivial concatenation of 
reduced ordered partitions for parking functions; equivalently, if (B 1 ,B 2 , . . . , B n ) is the 
reduced ordered partition of /, then \B X \ + \B 2 \ + • ■ ■ + \Bi\ > i for 1 < i < n. Then every 
parking function can be decomposed uniquely into prime parking functions. 

Corollary 10.3. For n > 1, 

J> luck ^ = -P n {l,u,-u), 
f 

where the sum is over all prime parking functions f on [n] . 

Proof. Let R be the generating for prime parking functions / weighted by w lucky ^. Then 
since the number of lucky cars in a parking function is compatible with the decomposition 
into prime parking functions, we have Q — 1/(1 — R), with Q as in the proof of Theorem 
ll(J. II Thus R = 1 — and the result follows from Theorem 1 1 (J . 1 1 and (J5.1U)) . □ 
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